Microsoft Word - manuscript
نویسندگان
چکیده
In recent years, there has been an increasing interest in planted (l, d) motif search (PMS) with applications to discovering significant segments in biological sequences. However, there has been little discussion about PMS over large alphabets. This paper focuses on motif stem search (MSS), which is recently introduced to search motifs on large-alphabet inputs. A motif stem is an l-length string with some wildcards. The goal of the MSS problem is to find a set of stems that represents a superset of all (l, d) motifs present in the input sequences, and the superset is expected to be as small as possible. The three main contributions of this paper are as follows: (1) We build motif stem representation more precisely by using regular expressions. (2) We give a method for generating all possible motif stems without redundant wildcards. (3) We propose an efficient exact algorithm, called StemFinder, for solving the MSS problem. Compared with the previous algorithms, StemFinder runs much faster and first solves the (17, 8), (19, 9) and (21, 10) challenging instances on protein sequences; moreover, StemFinder reports fewer stems which represent a smaller superset of all (l, d) motifs. StemFinder is freely available at http://sites.google.com/site/feqond/stemfinder.
منابع مشابه
Author's response to reviews Title:Few additional genetic mutations accumulate during metastatic progression in high-grade serous ovarian cancer Authors:
1. Line NumberingPlease revise your manuscript to include line and page numbers. Authors are asked to ensure that line numbering is included in the main text file of their manuscript at the time of submission to facilitate peer-review. Once a manuscript has been accepted, line numbering should be removed from the manuscript before publication. For authors submitting their manuscript in Microsof...
متن کاملMicrosoft Word - JBC manuscript 09-1-13
Background: The functional importance of C2 insert containing isoform of nonmuscle myosin II-C is not known.
متن کاملMicrosoft Word - manuscript rev2 v2
Background: -synuclein is an aggregation-prone protein which reconfigures more slowly under aggregating conditions. Results: Curcumin binds to monomeric synuclein, prevents aggregation and increases the reconfiguration rate, particularly at high temperatures. Conclusion: Curcumin rescues the protein from aggregation by making the protein more diffusive. Significance: The search for aggregatio...
متن کاملAvoiding ethical temptations
In most cases authors are permitted to post their version of the article (e.g. in Word or Tex form) to their personal website or institutional repository. Authors requiring further information regarding Elsevier's archiving and manuscript policies are encouraged to visit:
متن کاملSome problems, I care most
In most cases authors are permitted to post their version of the article (e.g. in Word or Tex form) to their personal website or institutional repository. Authors requiring further information regarding Elsevier's archiving and manuscript policies are encouraged to visit:
متن کاملMulti-armed spirals and multi-pairs antispirals in spatial rock–paper–scissors games
In most cases authors are permitted to post their version of the article (e.g. in Word or Tex form) to their personal website or institutional repository. Authors requiring further information regarding Elsevier's archiving and manuscript policies are encouraged to visit:
متن کامل